A formant vocoder based on mixtures of Gaussians

نویسندگان

  • Parham Zolfaghari
  • Tony Robinson
چکیده

Parham Zolfaghari Tony Robinson Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK. Tel: [+44] 1223 332754 Fax: [+44] 1223 332662 email : psz1000,[email protected] ABSTRACT This paper describes a new low bit-rate formant vocoder. The formant parameters are represented by Gaussian mixture distributions, which are estimated from the discrete Fourier transform (DFT) magnitude spectrum of the speech signal [12]. A voiced/unvoiced classi cation mechanism has been developed based on the harmonic nature of each formant in the DFT spectrum modulated by the Gaussian Mixture distribution. Using a magnitude-only sinusoidal synthesiser [8], intelligible synthetic speech has been obtained. Vector quantisation [3] of the vocal tract parameters enables this formant vocoder to operate at a bit-rate of 1248 bps.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A segmental formant vocoder based on linearly varying mixture of Gaussians

MIXTURE OF GAUSSIANS Parham Zolfaghari and Tony Robinson Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK. Tel: [+44] 1223 332754 Fax: [+44] 1223 332662 email : psz1000,[email protected] ABSTRACT This paper describes a low bit-rate segmental formant vocoder. The formants are estimated using mixture of Gaussians whose means are constrained to vary linearly w...

متن کامل

Formant analysis using mixtures of Gaussians

This paper describes a new formant analysis technique whereby the formant parameters are represented in the form of Gaussian mixture distributions. These are estimated from the Discrete Fourier Transform (DFT) magnitude spectrum of the speech signal. The parameters obtained are the means, variances and the masses of the density functions, which are used to calculate centre frequencies, bandwidt...

متن کامل

Application of speaker modification techniques to phonetic vocoding

The goal of the work described in this paper is to develop a very low bit rate vocoding scheme. The vocoder is a typical LPC vocoder, whose parameters are post-processed on a phone-byphone basis, resulting in a variable bit rate segment vocoder. Given the well known speaker recognizability problems presented by vocoders at such low bit rates, we have attempted to integrate a speaker modificatio...

متن کامل

Real Time Pitch Shifting with Formant Structure Preservation Using the Phase Vocoder

Pitch shifting in speech is presented based on the use of the phase vocoder in combination with spectral whitening and envelope reconstruction, applied respectively before and after the transformation. A band preservation technique is introduced to contain quality degradation when downscaling the pitch. The transposition ratio is fixed in advance by selecting analysis and synthesis window sizes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997